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Abstract 

Consider an election between k candidates in which each voter votes 
randomly (but not necessarily independently) and suppose that there is 
a single candidate that every voter prefers (in the sense that each voter is 
more likely to vote for this special candidate than any other candidate). 
Suppose we have a voting rule that takes all of the votes and produces a 
single outcome and suppose that each individual voter has little effect on 
the outcome of the voting rule. If the voting rule is a weighted plurality, 
then we show that with high probability, the preferred candidate will win 
the election. Conversely, we show that this statement fails for all other 
reasonable voting rules. 

This result is an extension of one by Haggstrom, Kalai and Mossel, 
who proved the above in the case k = 2. 



1 Introduction 

For elections between two candidates, it is well known that voting rules in which 
every voter has a small effect are good rules in the sense that they "aggregate 
information well:" if every voter has a small bias towards the same candidate 
then that candidate will win with overwhelming probability. When voters vote 
independently, this fact was noted by Margulis @] and Russo [5], whose results 
were later strengthened by Kahn, Kalai and Linial [3] and by Talagrand [BJ. 

When the voters are not independent, the situation is more complicated. It 
is no longer true, then, that every reasonable voting rule aggregates well. In 
fact, [2] show that if we want the aggregation to hold for every distribution of 
the voters, then weighted majority functions are the only option. We extend 
their result to the non-binary case. 

The author would like to thank Elchanan Mossel for suggesting this problem 
and providing fruitful discussions. 
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2 Definitions and results 



In the introduction, we made a few allusions to "reasonable" voting rules. Let 
us now say precisely what that means: we will require that our voting rules do 
not have a built-in preference for any alternative. This is a common assumption, 
and its definition is standard (see, eg. pQ). In what follows, the notation [k] 
stands for the set {0, . . . , k - 1}. 

Definition 2.1. A function f : [k] n -> [k] is neutral if f(a(x)) = a(f(x)) for 
all x € [k\ n and all permutations a on [k], where o~(x)i = <j(xi). 

Note that in the case k = 2, a function is neutral if, and only if, it is anti- 
symmetric according to the definition in [2]. 

Example 2.2 

When k = 2 and n is odd, then the simple majority function (for which f(x) = 1 
if : Xi = 1} > : Xi = 0}) is neutral. On the other hand, if n is even then in 
order to fully specify the simple majority function, we need to say what happens 
in the case of a tie; the choice of tie-breaking rule will determine whether the 
resulting function is neutral. For example, if we define fix) = x\ for every tied 
configuration x, then / is neutral. On the other hand, if f(x) = 1 for every tied 
configuration x, then / is not neutral. 

The example can be extended to k > 3. In this case, consider the tie-breaking 
rule f(x) = Xi where i is the smallest possible number for which Xi is equal to 
one of the tied alternatives. This tie-breaking rule is neutral, and it is more 
natural than setting f(x) = x\ because it guarantees that the output of / is one 
of the tied alternatives. 

2.1 Weighted plurality functions 

Let us say precisely what we mean by a weighted plurality function. The defini- 
tion that we take here generalizes the definition from [5] of a weighted majority 
function. 

Definition 2.3. A function f : [k] n -> [fc] is a weighted plurality function if 
there exist weights W\ , . . . , w n e R>o such that Y,i w i - 1 an d f or oil a,b z [k], 
f(x) = a implies that 

i:xi=a i:xi=b 

Note that the above definition does not prescribe a particular behavior if a 
tie occurs between two alternatives. If the weights are chosen so that ties never 
occur, then the weighted plurality function is clearly neutral. Moreover, for any 
set of weights we can construct a neutral weighted plurality function with those 
weights by following the tie-breaking rule outlined in Example 12.21 
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2.2 The influence of a voter 



The final notion that we need before stating our result is a way to quantify the 
power of a single voter. When k = 2, the notion of effect is well-established 
and can be found, for example, in [2J. However, there does not seem to be 
a well-established way of quantifying the effect of voters for non-binary social 
choice functions. Here, we propose a definition that closely resembles the one 
used in [5] for binary functions. 

Definition 2.4. Let f be a function [k\ n ->■ [fc] and fix a probability distribution 
P on [k] n . The effect of voter i is 

e t (f, P) = £ P(f(X) = j\Xi = j) - P(f(X) = j\Xi * j), 
4=1 

where X is a random variable distributed according to P. 

Note that for the case k - 2, the preceding definition reduces to 

ei(f,P) = 2(P(f(X) = l\Xt = 1) - P(f(X) = l\X t = 0)), 

which is just twice the definition in 2 of a voter's effect. Also, the effect is 
closely related to the correlation between the voters and the outcome: 

p(f(x) - m - j) - p(f(x) - m * j) - 

P{Xi =j)P{Xi +j) 
>4Cov(l {/=j}) l {Xi=j} ) 

and so 

e 4 (/,P)>4^Cov(l {/=j} ,l {Xi=j} ). 
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Example 2.5 

The simplest example of et(f, P) is when P is a product measure (ie. the Xi are 
independent) and the function / does not depend on its zth coordinate; in that 
case, P(f(X) = j\Xi = j) = P(f(X) = j\X { * j) for all j and so ei (f,P) = 0. 
On the other hand, if P is a distribution such that X\ = X^ = ■■■ = X n with 
probability 1, and if / is a plurality function, then P(f(X) = j\Xi = j) = 1 for 
all j, while P(f(X) = j\Xi * j) = 0; hence, e,(/,P) = 1 for all i. 

For a less trivial example, suppose that the Xi are independent and uniformly 
distributed on [k]. Let / be an unweighted plurality function. Then the Central 
Limit Theorem implies that ej(/,P) = O(^) as n -» oo. 

On the other hand, suppose that / is still an unweighted plurality function 
and the Xi are independent, but now P(Xi = 1) > P(Xi = j) + S for some 
S > and all j * 1. Then Hoeffding's inequality implies that P{f{X) = l\Xi) > 
1 - 2 exp(-(5 2 n/4) for sufficiently large n, regardless of the value of Xi. In 
particular, this implies that ei(f,P) = 0(exp(-6 2 rt/4)). Compared to the case 
where the Xi are uniformly distributed, this demonstrates that ej(/, P) can 
depend strongly on P, even when P is restricted to being a product measure. 
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2.3 The main result 



Our main theorem is the following: 

Theorem 2.6. (a) For every 5 > and e > 0, there is a r > such that for 
every weighted plurality function f with weights Wi and every probability 
distribution P on [k] n , ife^f, P) < r and there is a set A c [n] such that 
Y,i uiiP(Xi = a) > Y,i WiP(Xi = b) +5 for all i 6 [n] , all a e A and all b £ A, 
then P(f(X) e A) > 1-e. 

(b) If f is not a weighted plurality function then there exists a probability 
distribution P on [k] n such that P(Xi = 2) > P(X t = 1) for all i e [n] but 
P(f(X) = 1) = 1 (and hence e t (f, P) = for all i ). 

We remark that the Theorem is constructive in the sense that we can give 
an algorithm (based on solving a linear program) which either constructs some 
weights Wi witnessing the fact that / is a weighted plurality, or a probability 
distribution P satisfying part (b). 

Parts (a) and (b) of Theorem l2. Gl are converse to one another in the following 
sense: under the hypothesis of small effects, part (a) says that if there is a gap 
between the popularity of the most popular alternatives A and the less popular 
alternatives A c then a weighted plurality function will choose an alternative 
in A. Part (b) shows that this property fails for every function that is not a 
weighted plurality. Note that part (a) has an important special case, which is 
closer to the statement of [2j: if P(Xi = a) > P(Xi = b) + S for all i e [n] and all 
b + a, then f(X) = a with high probability if the effects are small enough. 

The remainder of the paper is devoted to the proof of Theorem 12.61 

Proof of Theorem 1 2. 6\ (a). This part of the proof follows very closely the argu- 
ment in p.. Suppose that / is a weighted plurality function with weights Wi. 
The first step is to show that / is "correlated" in some sense with each voter: 
define = P{Xi = j) and let Wj be the (random) weight assigned to alternative 
j: Wj = T,i:x t = 3 Wi. Then 

n k 

E £ wi E 1 {f(x)= J } (i{jc,=j} - ph) 
i=i j=x 

= E I X>il{/po=j}l{x i= j} - £ l{/(x)=j}WiPij 

= E£™ i %(x H }l { x W} - £P(/ = j)EW r (1) 

1,3 3 

Now, let ctj = P{f = j) and set dj = <x,/ (T,ieA a i) f° r 3 ' e A and dj = otherwise. 
The first term of (H|) is just 

E X>il{/(XW}l{X 4=J -} = E £ Mf(X)= 3 } W 3 
1,3 3 

^E£l {/w=J - } £a i W i = £a i EW i (2) 
j 11 
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since the winning alternative always has at least as much weight as any con- 
vex combination of alternatives. Since mhij^A EWj > max^y^ KWj + S, we can 
plug © into ([1]) to obtain 

© > E & i E Wj - Y, a j^Wj 

3 j 

^ Y( & j- a j) s 

jeA 

= 8P(fiA). 

Recalling that a{f,P) > Cov(l^ =J -}, l{ Xz =j}), we have 

n A; 

SP(f M) < e E ^ E Mnxy-niMx^} - Pa) 

^ i 

T 
< - 

~ 4 

and so one direction of the theorem is proved once we take r small enough that 
e>r/(4(5). □ 

The proof of the second part of the theorem follows the idea of [2] , in that 
we use linear programming duality to find a witness for / being a weighted 
plurality function. However, the details of the proof are quite different, since [5] 
uses a well-known linear program (the fractional vertex cover of a hypergraph) 
which does not extend beyond k = 2. 

The proof idea is this: we will write down a linear program and its dual. If 
the primal program has a large enough value it will turn out that / is a weighted 
plurality function. Otherwise, the dual has a small value and the dual variables 
witness the claim of Theorem l2.6l (b). In particular, note that this proof provides 
the algorithm that we mentioned after the statement of Theorem 12.61 

First we make a trivial observation that will simplify our linear program 
considerably: if a function is neutral, it is easier to check whether it is a weighted 
plurality because it is not necessary to try all possible combinations of a, b € [k]: 

Proposition 2.7. Suppose f : [fc]" -> [fc] is neutral. Then f is a weighted 
plurality if and only if there exist weights wi, . . . ,w n e R such that f(x) = 1 
implies that 

E wi > e w >- 

i:xi = l i:xi=2 

We can write a linear program for checking whether a given neutral function 
/ is a weighted plurality. The variables for this program are t; Wi for each 
i € [n] and g x for each x 6 [k] n for which f(x) = 1. In standard form, the primal 
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program is the following: 
maximize t+ - t_ 

subject to g x > for all x e [k] n such that f(x) = 1 
Wi > for all i e [n] 
t + >0 and t- > 

= 1 

i 

V Wi - V iw* ~ 5x ~ (t+ ~ i-) = for all x £ [fc]™ with /(x) = 1. 

i'.Xi=l i\Xi=2 

Proposition 2.8. Lett* be the value of the above linear program. Ift* > then 
f is a weighted plurality function. 

Proof. Let Wi, g x , t+ and t- be feasible points such that t+-t- > 0. Then, for 
all x with f(x) = 1, 

£ Wi- Y, W i = 9x + (t+ -t-) > 

i:Xi = l i\Xi=2 

and so / satisfies the conditions of Proposition ^. 71 □ 

Now consider the dual program; since the primal is in standard form, the 
dual is easy to write down. Let the dual variables be a and q x for all x such 
that f(x) = 1. Then the dual program is: 

minimize a + - a_ 
subject to 2J q x <-l 

x:f(x) = l 

Y, Ul^i} " 1 {x t =2})lx + (a+ - a-) > for all i e [n] 

x:f(x) = l 

q x < for all a; such that /(x) = 1 
a + < and a_ < 0. 

Proposition 2.9. Let a* be the value of the above dual program. If a* < then 
there exists a probability distribution on [k] n such that P(Xi = 2) > P(Xi = 1) 
for all i but f(X) = 1 almost surely. 

Proof. Choose a feasible point with a+ - a_ < and define p x = -q x /(T.xQx)- 
Then p x > and Y^xPx = 1, so we can define a probability distribution by 
P(X = x) = p x when f(x) = 1 and P(X = x) = otherwise. Under this 
distribution, f(X) = 1 with probability 1. On the other hand, with a+ - a_ < 
the constraints of the dual program imply that 

Y 1 {xi = l\<lx > Y 1 {x i =2}qx 
x:f(x) = l x:f(x) = l 
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for all i. Thus, 



P(Xi = l)= Y, 1{««=1}Px< E 1 {x t =2}Px = P(Xi = 2) 
x:/(x)=l z:/(x)=l 

for all i. □ 



To conclude the proof of Theorem 12.61 note that both the primal and dual 
programs are feasible and bounded and so a* = t* . 
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